Crowdsourcing Thumbnail Captions: Data Collection and Validation

نویسندگان

چکیده

Speech interfaces, such as personal assistants and screen readers, read image captions to users—but typically only one caption is available per image, which may not be adequate for all situations (e.g., browsing large quantities of images). Long provide a deeper understanding an but require more time listen to, whereas shorter allow thorough comprehension, yet have the advantage being faster consume. We explore how effectively collect both thumbnail captions—succinct descriptions meant consumed quickly—and comprehensive captions—which individuals understand visual content in greater detail; we consider text-based instructions time-constrained methods at these two levels detail find that method most effective collecting while preserving accuracy. Additionally, verify authors using this are still able focus on important regions by tracking their eye gaze. evaluate our collected along human-rated axes—correctness, fluency, amount detail, mentions concepts—and discuss potential model-based metrics perform large-scale automatic evaluations future.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Crowdsourcing Gaze Data Collection

Knowing where people look is a useful tool in many various image and video applications. However, traditional gaze tracking hardware is expensive and requires local study participants, so acquiring gaze location data from a large number of participants is very problematic. In this work we propose a crowdsourced method for acquisition of gaze direction data from a virtually unlimited number of p...

متن کامل

Microsoft COCO Captions: Data Collection and Evaluation Server

In this paper we describe the Microsoft COCO Caption dataset and evaluation server. When completed, the dataset will contain over one and a half million captions describing over 330,000 images. For the training and validation images, five independent human generated captions will be provided. To ensure consistency in evaluation of automatic caption generation algorithms, an evaluation server is...

متن کامل

Tour Planning for Crowdsourcing Sensor Data Collection

A disaster surveillance system that crowdsources observation data needs to compute a tour for each volunteer to follow during the data collection process. A common approach to do this computation is to first assign a subset of locations to each volunteer, and then solve the classical traveling salesman problem to find an optimal tour connecting locations in each subset. This paper describes the...

متن کامل

A TACRED Data Collection and Validation

TACRED leverages the work done selecting query entities and annotating system responses in the TAC KBP evaluations. In each year of the TAC KBP evaluation (2009–2015), 100 query entities are given to participating KBP systems with the aim of filling in valid knowledge base entries for these entities. Our annotation effort re-uses these query entities, annotating each sentence in the source corp...

متن کامل

Data collection, processing, validation, and verification.

The collection, processing, validation, verification, formatting, filing, and storage of the required input data are some of the most important components in the National Institute for Occupational Safety and Health (NIOSH) Radiation Dose Reconstruction Program. Without question, the quality and scientific validity of the reconstructed dose estimates are totally dependent on these aspects of th...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: ACM transactions on interactive intelligent systems

سال: 2023

ISSN: ['2160-6455', '2160-6463']

DOI: https://doi.org/10.1145/3589346